Redundant data forwarding storage

ABSTRACT

Methods and apparatus, including computer program products, for redundant data forwarding are described. In one respect, the method includes intermittently forwarding the portion of the data among the first memory and memories of other nodes in the first network without storing the portion of data on any physical storage device of the interconnected nodes in the first network. The method may also include intermittently forwarding the first copy of the portion of the data among the second memory and memories of other nodes in the second network without storing the first copy of the portion of the data on any physical storage device of the interconnected nodes in the second network.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 12/052,345, filed Mar. 20, 2008, entitled “Redundant Data Forwarding Storage,” herein incorporated by reference in its entirety. Any and all priority claims identified in the Application Data Sheet, or any correction thereto, are hereby incorporated by reference under 37 C.F.R. §1.57.

FIELD AND BACKGROUND

At least some embodiments disclosed herein relate to data storage, and more particularly, to redundant data forwarding storage.

The volume of data that must be stored by individuals, organizations, businesses and government is growing every year. In addition to just keeping up with demand, organizations face other storage challenges. With the move to on-line, real-time business and government, critical data must be protected from loss or inaccessibility due to software or hardware failure. Today, many storage products do not provide complete failure protection and expose users to the risk of data loss or unavailability. For example, many storage solutions on the market today offer protection against some failure modes, such as processor failure, but not against others, such as disk drive failure. Many organizations are exposed to the risk of data loss or data unavailability due to component failure in their data storage system.

The data storage market is typically divided into two major segments, i.e., Direct Attached Storage (DAS) and Network Storage. DAS includes disks connected directly to a server.

Network Storage includes disks that are attached to a network rather than a specific server and can then be accessed and shared by other devices and applications on that network. Network Storage is typically divided into two segments, i.e., Storage Area Networks (SANs) and Network Attached Storage (NAS).

A SAN is a high-speed special-purpose network (or subnetwork) that interconnects different kinds of data storage devices with associated data servers on behalf of a larger network of users. Typically, a SAN is part of the overall network of computing resources for an enterprise. A storage area network is usually clustered in close proximity to other computing resources but may also extend to remote locations for backup and archival storage, using wide area (WAN) network carrier technologies.

NAS is hard disk storage that is set up with its own network address rather than being attached to the local computer that is serving applications to a network's workstation users. By removing storage access and its management from the local server, both application programming and files can be served faster because they are not competing for the same processor resources. The NAS is attached to a local area network (typically, an Ethernet network) and assigned an IP address. File requests are mapped by the main server to the NAS file server.

All of the above share one common feature that can be an Achilles tendon in more ways than one, i.e., data is stored on a physical medium, such as a disk drive, CD drive, and so forth.

SUMMARY OF THE DESCRIPTION

The present invention provides methods and apparatus, including computer program products, for redundant data forwarding.

In general, in one aspect, the invention features a method including, in two or more networks of interconnected computer system nodes, receiving a request from a source system in a first network to store data, directing the data to a first computer memory in a first network, directing a first copy of the data to a second computer memory in a second network, continuously forwarding the data from the first computer memory to other computer memories in the first network without storing on any physical storage device in the first network, and continuously forwarding the first copy of the data from the second computer memory to other computer memories in the second network without storing on any physical storage device in the second network.

In another aspect, the invention features a system including, at least two networks wherein computer system nodes are each adapted to receive data and copies of data and continuously forward the data and copies of data from computer memory to computer memory without storing on any physical storage device in response to a request to store data from a requesting system.

The details of one or more implementations of the invention are set forth in the accompanying drawings and the description below. Further features, aspects, and advantages of the invention will become apparent from the description, the drawings, and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The embodiments are illustrated by way of example and not limitation in the FIGS. of the accompanying drawings in which like references indicate similar elements.

FIG. 1 is a block diagram of an exemplary system.

FIG. 2 is a block diagram of an exemplary user system.

FIG. 3 is a block diagram of an exemplary network system.

FIG. 4 is a flow diagram of a process.

FIG. 5 is a flow diagram of a process.

DETAILED DESCRIPTION

Unlike peer to peer networks, which use data forwarding in a transient fashion so that data is eventually stored on a physical medium such as a disk drive, the present invention is a continuous redundant data forwarding system, i.e., data and copies of data are stored by continually forwarding it from one node memory to another node memory. Copies of data may continuously forwarded in one or more networks.

As shown in FIG. 1, an exemplary system 10 includes a user system 12 and a number of network systems 14, 16, 18, 20, 22. Each of the network systems 14, 16, 18, 20, 22 can be considered to be a node in the system 10 and one such network system may be designated as a central server, such as network system 14, which may assume a control position in system 10. Each of the nodes 14, 16, 18, 20, 22 may be established as a privately controlled network of peers under direct control of the central server 14. Peered nodes may also be a mix of private and public nodes, and thus not under the direct physical control of the central server 14. The system 10 may also be wholly public where the central server 14 (or servers) has no direct ownership or direct physical control of any of the peered nodes.

In one example, nodes 14, 16, 18, 20 and 22 can be considered a private network. In a private network, an administrator controls the nodes and may designate which node is the central server. The system 10 can also include one or more additional nodes. For example, nodes 24, 26 and 28. These nodes 24, 26 and 28 may be considered to be part of one or more public networks in which the administrator has little or no control.

As shown in FIG. 2, the user system 12 can include a processor 30, memory 32 and input/output (I/O) device 34. Memory 32 can include an operating system (OS) 36, such as Linux, Apple® OS or Windows®, one or more application processes 38, and a storage process 100, explained in detail below. Application processes 38 can include user productivity software, such as OpenOffice or Microsoft® Office. The I/O device 34 can include a graphical user interface (GUI) 40 for display to a user 42.

As shown in FIG. 3, each of the network systems, such as network system 14, can include a processor 50 and memory 52. Memory 52 can include an OS 54, such as Linux, Apple® OS or Windows®, and a data forwarding process 200, explained in detail below.

In traditional systems, application processes 38 need to store and retrieve data. In these traditional systems, data is stored on local or remote physical devices, and copies of data, which are used to provide redundancy, are stored locally or on remote physical storage devices such as disk drives. And in some systems, this data can be segmented into different pieces or packets and stored locally or remotely on physical mediums of storage. Use of fixed physical data storage devices add cost, maintenance, management and generate a fixed physical record of the data, whether or not that is the desire of the user 42.

The present invention does not use fixed physical data storage to store data and does not use physical data storage to provide data redundancy. When a request to store data is received by the central server 14 from storage process 100, data is directed to a node in the system 10 where it is then continuously forwarded from node memory to node memory in the system 10 by the data forwarding process 200 in each of the network nodes without storing on any physical storage medium such as a disk drive. The request to store data makes at least one copy of the data, which is directed to a node in a secondary private or public network, or directed to nodes on more than one network, where it too is continuously forwarded from node memory to node memory in the secondary private or public network. The forwarded data resides only for a very brief period of time in the memory of any one node in the system 10. Data and copies of data are not stored on any physical storage medium in any network node.

When a request to retrieve data is received by the central server 14 from storage process 100, the requested data, which is being forwarded from node memory to node memory in the system 10, is retrieved.

Data forwarded in this manner can be segmented and segments forwarded as described above. Still, the segmented data is not stored on any physical storage medium in any network node, but merely forwarded from the memory of one node to the memory of another node.

As shown in FIG. 4, storage process 100 includes sending (102) a request to a central server 14 to store or retrieve data. If the request is a retrieve data request, storage process 100 receives the requested data from the central server 14 or node in the network.

If the request to the central server 14 is a store data request, storage process 100 receives (104) first address of a node and a second address of a node from the central server 14 and forwards (106) the data to the node memory represented by the received first address and a copy of the data to the node memory represented by the received second address.

As shown in FIG. 5, data forwarding process 200 includes receiving (202) a request from a source system in a first network to store data.

Process 200 directs (204) the data to the first computer memory in a first network and directs (206) a first copy of the data to a second computer memory in a second network. Directing (206) may be to node memories in one or more networks, both private and/or public.

Process 200 continuously forwards (208) the data from the first computer memory to other computer memories in the first network without storing on any physical storage device in the first network.

Continuously forwarding (208) includes detecting a presence of the data in memory of the specific node of the first network and forwarding the data to another computer memory of a node in the first network of interconnected computer system nodes without storing any physical storage device.

Process 200 continuously forwards (210) the first copy of the data from the second computer memory to other computer memories in the second network without storing on any physical storage device in the second network.

Continuously forwarding (210) includes detecting a presence of the first copy of data in memory of the specific node of the second network, and forwarding the first copy of the data to another computer memory of a node in the second network of interconnected computer system nodes without storing any physical storage device.

In one specific example, at the point of entry to a node, data undergoes an encrypted “handshake” with the node or central server 14 or user. This can be a public or private encryption system, such as the Cashmere system, which can use public-private keys. Cashmere decouples the encrypted forwarding path and message payload, which improves the performance as the source only needs to perform a single public key encryption on each message that uses the destination's unique public key. This has the benefit that only the true destination node will be able to decrypt the message payload and not every node in the corresponding relay group. Cashmere provides the capability that the destination can send anonymous reply messages without knowing the source's identity. This is done in a similar way, where the source creates a reply path and encrypts it in a similar manner as the forwarding path.

In another example, other routing schemes are utilized.

New nodes and node states may be added and/or deleted from the system 10 based upon performance. Users may have access to all nodes or may be segmented to certain nodes or “node states” by the central server(s) or via the specific architecture of the private, public or private-public network.

Individual nodes, nodes states and supernodes may also be extranet peers, wireless network peers, satellite peered nodes, Wi-Fi peered nodes, broadband networks, and so forth, in public or private networks. Peered nodes or users may be used as routing participants in the system 10 from any valid peer point with the same security systems employed, as well as custom solutions suitable for the rigors of specific deployments, such as wireless encryption schemes for wireless peers, and so forth.

In process 200, rather than have data cached or held in remote servers, hard drives or other fixed storage medium, the data and copies of data are passed, routed, forwarded from node memory to node memory. The data and copies of data are never downloaded until the authorized user calls for the data. A user on the system may authorize more than one user to have access to the data.

A primary goal in process 200 is to generate a redundant data storage and management system where the redundant data is never fixed in physical storage, but in fact, is continually being routed/forwarded from node memory to node memory. The path of the nodes to which redundant data is forwarded may also be altered by the central server 14 to adjust for system capacities and to eliminate redundant paths of data that may weaken the security of the network due to the increased probability of data path without this feature.

The invention can be implemented to realize one or more of the following advantages. One or more networks create redundant data storage without caching or downloads. Redundant data storage and management are accomplished via a constant routing of the redundant data.

Embodiments of the invention can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. Embodiments of the invention can be implemented as a computer program product, i.e., a computer program tangibly embodied in an information carrier, e.g., in a machine readable storage device or in a propagated signal, for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple computers. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.

Method steps of embodiments of the invention can be performed by one or more programmable processors executing a computer program to perform functions of the invention by operating on input data and generating output. Method steps can also be performed by, and apparatus of the invention can be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non volatile memory, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in special purpose logic circuitry.

It is to be understood that the foregoing description is intended to illustrate and not to limit the scope of the invention, which is defined by the scope of the appended claims. Other embodiments are within the scope of the following claims. 

What is claimed is:
 1. A computerized method comprising: receiving a request from a source node to store data, wherein the source node is outside of a first network of interconnected nodes that are configured to store one or more portions of the data and the source node is outside of a second network of interconnected nodes that are configured to store one or more portions of the data, each of the interconnected nodes in the first network and the second network comprising a memory; directing at least a portion of the data to a first memory of a first node in the first network; directing a first copy of the portion of the data to a second memory of a second node in the second network; intermittently forwarding the portion of the data among the first memory and memories of other nodes in the first network without storing the portion of the data on any physical storage device of the interconnected nodes in the first network, wherein the interconnected nodes in the first network to which the portion of the data is forwarded are determined dynamically based at least in part on status of one or more of the interconnected nodes in the first network or previous paths used for forwarding one or more portions of the data; and intermittently forwarding the first copy of the portion of the data among the second memory and memories of other nodes in the second network without storing the first copy of the portion of the data on any physical storage device of the interconnected nodes in the second network, wherein the interconnected nodes in the second network to which the first copy of the portion of the data is forwarded are determined dynamically, wherein physical storage devices include one or more of hard disks, magnetic disks, magnetic tape, magneto optical disks, or optical disks.
 2. The computerized method of claim 1 further comprising: directing a second copy of the portion of the data to a third memory of a third node in a third network of interconnected nodes; and intermittently forwarding the second copy of the portion of the data among the third memory and memories of other nodes in the third network without storing the second copy of the data on any physical storage device of the interconnected nodes in the third network.
 3. The computerized method of claim 1, wherein each of the networks comprise one or more of a private network and a public network.
 4. The computerized method of claim 1 wherein at least one of said intermittently forwarding of the portion of the data and said intermittently forwarding of the first copy of the portion of the data comprises: determining a first address of a first available node in the first network to receive the portion of the data based on one or more first factors; determining a second address of a second available node in the second network to receive the first copy of the portion of the data based on one or more second factors; sending a first message to a first specific node of the first network associated with memory that contains the portion of the data, the first message comprising the first address of the first available node and a request to forward the portion of the data; and sending a second message to a second specific node of the second network associated with memory that contains the first copy of the portion of the data, the second message comprising the second address of the second available node and a request to forward the first copy of the portion of the data.
 5. The computerized method of claim 4 wherein the one or more first factors and the one or more second factors comprise network traffic analysis and available memory.
 6. The computerized method of claim 1 wherein at least one of said intermittently forwarding of the portion of the data and said intermittently forwarding of the first copy of the portion of the data comprises: detecting presence of the portion of the data in a first specific memory of a first specific node of the first network; forwarding the portion of the data to a second specific memory of a second specific node in the first network without storing the portion of the data on any physical storage device of the interconnected nodes of the first network; detecting presence of the first copy of the portion of the data in a third specific memory of a third specific node of the second network; and forwarding the first copy of the portion of the data to a fourth specific memory of a fourth node in the second network without storing the first copy of the portion of the data on any physical storage device of the interconnected nodes of the second network.
 7. The computerized method of claim 6 further comprising: associating one or more specific users with the portion of the data; and retrieving the portion of the data in response to receiving a retrieval request for the data from one of the specific users.
 8. A non-transitory computer readable medium configured to store software code that is readable by a computing system having one or more processors, wherein the software code is executable on the computing system in order to cause the computing system to perform operations comprising: receiving a request to store data from a source node that is not part of a first network of interconnected nodes or a second network of interconnected nodes; intermittently forwarding at least a portion of the data between memories of the interconnected nodes in the first network without storing the portion of the data on any physical storage device of the interconnected nodes in the first network, wherein the interconnected nodes in the first network to which the portion of the data is forwarded are determined dynamically based at least in part on status of one or more of the interconnected nodes in the first network or previous paths used for forwarding one or more portions of the data; and intermittently forwarding a first copy of the portion of the data between memories of the interconnected nodes in the second network without storing the first copy of the portion of the data on any physical storage device of the interconnected nodes in the second network, wherein the interconnected nodes in the second network to which the first copy of the portion of the data is forwarded are determined dynamically, wherein physical storage devices include one or more of hard disks, magnetic disks, magnetic tape, magneto optical disks, or optical disks.
 9. The non-transitory computer readable medium of claim 8 wherein the operations further comprise intermittently forwarding a second copy of the portion of the data between memories of the interconnected nodes in a third network without storing the second copy of the portion of the data on any physical storage device of the interconnected nodes in the third network.
 10. The non-transitory computer readable medium of claim 8 wherein at least one of said intermittently forwarding of the portion of the data and said intermittently forwarding of the first copy of the portion of the data comprises: determining a first address of a first node in the first network that is available to receive the portion of the data, wherein the determination is based on one or more factors; sending a message to a specific node that is associated with memory that contains the portion of the data, the message comprising the address of the first node and a request to forward the portion of the data to the first node; and applying a time stamp to the portion of the data in memory of the first node in the first network.
 11. The non-transitory computer readable medium of claim 10 wherein the one or more factors comprise network traffic analysis and available memory.
 12. The non-transitory computer readable medium of claim 10 wherein at least one of said intermittently forwarding the data of the portion of the data and said intermittently forwarding of the first copy of the portion of the data further comprises: detecting a presence of the portion of the data in a memory of the specific node; and forwarding the portion of the data to memory of the first node in the first network from the memory of the specific node of the first network that contains the portion of the data without storing the portion of the data on any physical storage device of the interconnected nodes of the first network.
 13. The non-transitory computer readable medium of claim 12 wherein the operations further comprise: associating one or more specific users with the portion of the data; and retrieving the portion of the data in response to receiving a request for the data from one of the specific users.
 14. The non-transitory computer readable medium of claim 10 wherein the first copy of the portion of the data is sent to a private network selected by a user or user application.
 15. A system comprising: at least two networks of interconnected nodes, each node comprising a memory and each node being configured to receive one or more portions of data or a copy of one or more portions of the data from other memories of nodes within the respective network, wherein each of the interconnected nodes is further configured to intermittently forward the portion of the data or the copy of the portion of the data among the memories of the interconnected nodes of the respective network without storing the portion of the data or the copy of the portion of the data on any physical storage device associated with the interconnected nodes of the respective network, and wherein the interconnected nodes to which the portion of the data or the copy of the portion of the data is forwarded are determined dynamically based at least in part on status of one or more of the interconnected nodes or previous paths used for forwarding one or more portions of the data or a copy of one or more portions of the data, wherein physical storage devices include one or more of hard disks, magnetic disks, magnetic tape, magneto optical disks, or optical disks.
 16. The system of claim 15 wherein at least one node of the interconnected nodes is adapted to encrypt the portion of the data or the copy of the portion of the data.
 17. The system of claim 15 wherein at least one node of the interconnected nodes is adapted to intermittently forward a second copy of the portion of the data between memories of the interconnected nodes in a third network without storing the second copy of the portion of the data on any physical storage device of the interconnected nodes in the third network.
 18. A computer system comprising: a computer memory; at least one network interface configured to allow the computer system to communicate with two or more networks of interconnected nodes, each of the interconnected nodes in the two or more networks comprising a memory; and a processor configured to: direct at least a portion of data to a first memory of a node in a first network from a source node, wherein the source node is not in the two or more networks; and initiate intermittent forwarding of the portion of the data among the first memory and other memories of interconnected nodes of the first network without storing the portion of the data on any physical storage device of the interconnected nodes in the first network, wherein the interconnected nodes in the first network to which the portion of the data is forwarded are determined dynamically based at least in part on status of one or more of the interconnected nodes in the first network or previous paths used for forwarding one or more portions of the data, wherein physical storage devices include one or more of hard disks, magnetic disks, magnetic tape, magneto optical disks, or optical disks.
 19. The computer system of claim 18, wherein the processor of the computer system is further configured to: detect a presence of the portion of the data in a first specific memory of a first specific node in the first network; determine based on one or more factors an address of a second memory of a second node in the first network, wherein the second memory is available to receive the portion of the data; and send a message to the first specific node in the first network, the message comprising an address of the second memory and a request to forward the portion of the data to the second memory.
 20. The computer system of claim 19, wherein the processor is further configured to: associate one or more specific users with the portion of the data; and retrieve the portion of the data in response to receiving a request for the data from one of the specific users. 